Greedy Approach to Reliable Disease Susceptibility Prediction
نویسندگان
چکیده
One of the main problems in genetic epidemiology is to robustly predict genetic susceptibility to complex diseases based on the data from case/control studies. This becomes computationally challenging in presence of interactions between multiple genes. In order to efficiently search through enormous amount of possible combinations, it is necessary to apply heuristics and the greedy approach has been successfully validated on many real data. In this paper we modify the genotype covering phase of the model-fitting susceptibility prediction algorithm from [1]. We improve reliability of the previously known prediction method based on greedy approach by replacing clean genotype coverage with dirty coverage, i.e., clusters participating in coverage do not overlap or overlap respectively. We have leave-one/many-out cross-validated existed and proposed prediction methods on real case/control studies of four diseases (Chron’s disease, autoimmune disorder, tick-born encephalitis, and lung cancer). Our results show that relaxation of the clean coverage significantly improves reliability of the greedy based susceptibility prediction approach.
منابع مشابه
Message from the Poster Chairs
One of the main problems in genetic epidemiology is to robustly predict genetic susceptibility to complex diseases based on the data from case/control studies. This becomes computationally challenging in presence of interactions between multiple genes. In order to efficiently search through enormous amount of possible combinations, it is necessary to apply heuristics and the greedy approach has...
متن کاملDiscrete Algorithms for Analysis Of
Accessibility of high-throughput genotyping technology makes possible genomewide association studies for common complex diseases. When dealing with common diseases, it is necessary to search and analyze multiple independent causes resulted from interactions of multiple genes scattered over the entire genome. The optimization formulations for searching disease-associated risk/resistant factors a...
متن کاملCombinatorial Methods for Disease Association Search and Susceptibility Prediction
Accessibility of high-throughput genotyping technology makes possible genome-wide association studies for common complex diseases. When dealing with common diseases, it is necessary to search and analyze multiple independent causes resulted from interactions of multiple genes scattered over the entire genome. This becomes computationally challenging since interaction even of pairs gene variatio...
متن کاملHybrid Method of Logistic Regression and Data Envelopment Analysis for Event Prediction: A Case Study (Stroke Disease)
Abstract Predictive analytics is an area of statistics that deals with extracting information from data and using it to predict trends and behavior patterns. Many mathematical modeling has been developed and used for prediction, and in some cases, they have been found to be very strong and reliable. This paper studies different mathematical and statistical approaches for events prediction. The ...
متن کاملAn Improved Junction-Based Directional Routing Protocol (IJDRP) for VANETs
Vehicular Ad-Hoc Networks (VANETs) is a novel technology that has recently emerged and due to its swift changing topology and high mobility nature, it has become problematic to design an efficient routing protocol in VANETs’ amongst both moving and stationary units. Also, the existing routing algorithms are not very effective to satisfy all requirements of VANETs. This paper explores the need o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007